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Abstract 

Signal-to-leakage-and-noise ratio (SLNR) is a promising criterion for linear precoder design in 
multi-user (MU) multiple-input multiple-output (MIMO) systems. It decouples the precoder design 
q _ problem and makes closed-form solution available. In this letter, we present a new linear precoding 

scheme by slightly relaxing the SLNR maximization for MU-MIMO systems with multiple data streams 
■ per user. The precoding matrices are obtained by a general form of simultaneous diagonalization of two 

Hermitian matrices. The new scheme reduces the gap between the per-stream effective channel gains, 
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an inherent limitation in the original SLNR precoding scheme. Simulation results demonstrate that the 



m 
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, proposed precoding achieves considerable gains in error performance over the original one for multi 

■ stream transmission while maintaining almost the same achievable sum-rate 



Index Terms 

Signal-to-leakage-and-noise ratio (SLNR), linear precoding, multi-user MIMO. 

I. Introduction 

The significance of a downlink multi-user multiple-input-multiple-output (MU-MIMO) system 
is to allow a base station (BS) to communicate with several co-channel mobile stations (MS) 
simultaneously and thereby considerably increase the system throughput. To utilize the benefit, it 
is essential to suppress co-channel interference (CCI). Among many CCI suppression schemes, 
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linear precoding gains the popularity because of its simplicity for implementation and good 
performance. To design the optimal linear MU-MIMO precoding scheme, it is often desirable to 
maximize the output signal-to-interference-plus-noise ratio (SINR) for each user. However, this 
problem is known to be challenging due to its coupled nature and no closed-form solution is 
available yet. A more tractable but suboptimal design is to enforce a zero-CCI requirement for 
each user, such as block diagonalization (BD) flU and coordinated beamforming (CB) 0. 

In J3), the authors propose a so-called signal-to-leakage-and-noise ratio (SLNR) as the op- 
timization metric for linear precoder design. This metric transforms a coupled optimization 
problem into a completely decoupled one, for which a closed-form solution is available. Unlike 
the BD approach, it does not impose a restriction on the number of transmit antennas at the BS. 
Moreover, it is applicable for any number of users and data streams in contrast to CB scheme. 
Specifically, the SLNR based linear precoding weights in (3l are obtained by the generalized 
eigenvalue decomposition (GED) of the channel covariance matrix and the leakage channel- 
plus-noise covariance matrix of each user. However, a drawback of such GED based precoding 
scheme is that, when each user has multiple data streams, the effective channel gain for each 
stream can be severely unbalanced. If power control or adaptive modulation and coding cannot 
be applied, the overall error performance of each user will suffer significant loss. 

In this letter, we present a new linear precoding scheme based on the SLNR criterion for a 
downlink MU-MIMO system with multiple data streams per user. The design goal is to reduce 
the margin between the effective SINRs of multiple data streams. To do this, we introduce a slight 
relaxation for pursuing SLNR maximization (Note that maximizing SLNR at the transmitter side 
does not necessarily lead to output SINR maximization at each receiver). Thereby, we obtain 
a general form of simultaneous diagonalization of two covariance matrices linked to the user's 
channel and leakage-plus-noise. Based on that, the new precoding matrices are then obtained. 
We also present a simple and low-complexity algorithm to compute the precoding matrix for 
each user. Simulation results confirm that, compared with the original scheme, our scheme 
demonstrates sizable performance gains in error rate performance for multi-stream transmission 
while maintaining almost the same sum-rate performance. 

Notations: E(-), Tr(-), and (-) H denote expectation, trace, inverse, and conjugate 

transpose, respectively. represents the Frobenius norm. In is the N x N identity matrix, 
diag (ai, ■ • • , a/v) is the diagonal matrix with element a n on the n-th diagonal. Besides, C MxN 
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represents the set of M x N matrices in complex field. 

II. System Model 

We consider a downlink MU-MIMO system with N transmit antennas and M receive antennas 
at each of the K active users. Let £ C MxN denotes the channel from the BS to the MS 
k and H fc = [Hf , • • • , Hg_ v Hf +1 , • • • ,H%\ H £ £(k-i)mxn represent the corresponding 
concatenated leakage channel. A spatially uncorrelated flat Rayleigh fading channel is assumed. 
The elements of H fc are modeled as independent and identically distributed complex Gaussian 
variables with zero-mean and unit-variance. In addition, we assume H fc , and also H fc , have full 
rank with probability one. For a specific vector time, the transmitted vector symbol of user k is 
denoted as s& £ C Lxl , where L (< M) is the number of data streams supported for user k and 
is assumed equal for all the users for simplicity. The vector symbol satisfies the power constraint 
E (sfcS^) = I L . Before entering into the MIMO channel, the vector is pre-multiplied by a 
precoding matrix £ C NxL . Here, power allocation and rate adaptation among data streams 
can be applied. However, the signal design or feedback support may be relatively complex and 
thus we resort to precoding design only in this work. Then, for a given user k, the received 
signal vector can be written as 

EK 
FjSi + iifc (1) 
t=l,t=£k 

in which the second term represents CCI and the third term is the additive white Gaussian noise 
with E (n fc n^) = a 2 l M . 

We review the original SLNR based precoding scheme in [3j. Recall that the SLNR is defined 
as the ratio of received signal power at the desired MS to received signal power at the other 
terminals (the leakage) plus noise power without considering receive matrices, given by 

SLNRfr — - (2) 

TY(F£ (M/La'I + H»H t )F„y 

for k — 1, ■ • • , K. According to the SLNR criterion, the precoding matrix F fe is designed based 
on the following metric 

F° pt = arg max SLNR fc (3) 

with Tr (F fc F^) = L for power limitation. Since H^H fc is Hermitian and positive semidefi- 
nite (HPSD) and Mj La 2 I + H^H fc is Hermitian and positive definite (HPD), by generalized 
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eigenvalue decomposition, there exists an invertible matrix T k G C such that 

Tf Hf H fc T fc = \ k = diag (Ai, • ■ ■ , X N ) (4) 

Tf (M/La 2 I + Hf H fc ) T fc = Ijv (5) 

with Ai > A2 > ■ ■ • > \n > 0. Here, the columns of and the diagonal entries of A k are the 
generalized eigenvectors and eigenvalues of the pair {H^H fc , M/La 2 l + H^H fc }, respectively. 
It is then shown in that the optimal precoder which is able to maximize the objective function 
© can be obtained by extracting the leading L columns of T k as 

F° pt = pT fc [I L ;0], (6) 

where p is a scaling factor so that Tr (F fc F^) = L. The resulting maximum SLNR value is 
given by SLNR™ ax = Y^f=i Along with the realization of the precoder, the matched- filter 
type receive matrix, denoted as = (H^F^)^, is applied at each user receiver, resulting in 
inter-stream interference free. Note that better performance could be achieved if a multi-user 
MMSE type receiver is adopted. In this letter, we still adopt MF-type detector at the receiver as 
in [3] for implementation simplicity and analytical convenience. 

A drawback of such GED based precoding scheme is that, when L > 2, the effective channel 
gain for each stream can be severely unbalanced as shall be illustrated in Section III-C. It is 
known that the overall performance of a user with multiple streams is dominated by the stream 
with the worst channel condition. Hence, such channel imbalance would lead to poor overall 
error performance for a user. In the next section, we allow a slight relaxation on the SLNR 
maximization, which provides additional degrees of freedom to design a new precoding scheme 
so as to overcome this drawback. 

III. Proposed Precoding Scheme 
A. Design Principle by Matrix Theory 

The expressions in © and © by the GED approach motivate us to find a more general form 
of simultaneous diagonalization of two matrices. Before introducing our results in Proposition 
1, we review the following Lemma 0] Ch. 4, 4.5.8]: 

Lemma 1: Let A, B 6 £nxn ^ £ Hermitian. There is a non-singular matrix S G C nxn such 
that S H AS = B if and only if A and B have the same inertia, that is, have the same number 
of positive, negative, and zero eigenvalues. 
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Proposition 1: For the pair of matrices {Hf H fc , M/La 2 I + Hf Hfc}, there is a non-singular 
matrix P fc £ C NxN such that 

Pf Hf H fc P fc = fe (7) 

Pf (M/La 2 I + Hf H fc ) P fc = n k (8) 

in which @fc = diag # 2 , • • • 0jv) an d ^fe — diag (u^u^, ■ ■ -U)n) with the entries satisfying 
1 > 6i > ■ ■ ■ > 6 M > 0, 6 M +\ = ■ ■ ■ = 8 N = and < ux < ■ ■ ■ < u M < 1, u M +i = ■ ■■ = 
uon = 1 as well as 9i + Ui — 1 for % = 1, 2, • • • , N. 

Proof: Denote A fc = Hf H fc , B fc = M/La 2 I + H fc and C k = A k + B fc . Let the 
eigenvalues Aj (A fc ) , Aj (B fc ) and Aj (Cfc), z = 1, 2, • • • , N, be arranged in increasing order. 
Since A fc is HPSD and B fc is HPD, namely, A, (A k ) > and A* (B fc ) > 0, then by [ffl 4.3.1], we 
have Ai (C fc ) > A^ (A fe ) + A x (Bfc.) > 0, Wi. This implies that C k is HPD. Then, by the matrix 
theory in [@] 4.5.8, Exercise], there must be a non-singular matrix Q fc £ (£ N * N such that 

Qf CfcQfc = Qf (Afc + Bfc) Qfc= l N . (9) 

Further, denote A' k = Qf A fc Qfc and B' fe = Qf B fc Qfc. By Lemma 1, it can be shown that A' k 
and B' fc have the same inertia with Afc and Bfc, respectively. Thus, A' k is HPSD and B' fc is HPD. 
Now, by using 0] 4.3.1] again, it is easy to show that 1 > A; (A' fe ) > and 1 > A^ (B' fc ) > 0. 
Next, according to the eigen-decomposition (ED) of a Hermitian matrix JU, there must be a 
unitary matrix Ufc £ C Wx,v such that 

Uf A' fe Ufc = diag (Ai (A' k ) , • • ■ , X N (A' k )) . (10) 

Applying Ufc in both sides of ©, we obtain 

Uf (A' k + B' k ) Ufc= I N . (11) 

Hence, observing (flOl) and ([Til) , we find that it is necessary for Uf B' fc Ufc to satisfy Uf B' fc Ufc = 
diag ((1 - Ai(A' fc )) , ■ • • , (1 - Ajv(A' fc ))) . Clearly, as U fc is unitary, then {1 - A, (A' k )}f =1 must 
be the eigenvalues of B' fc . To this end, we define Pfc = QfcUfc. Since rank (Hf H fc ) = rank (H fc ) = 
M and the rank is unchanged upon left or right multiplication by a nonsingular matrix, then we 
arrive at the results in © and ®. ■ 
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Algorithm 1 The specific design of precoder F' k for user k 
Input: A fc = Hf H fc , and C k = (Hf H* + M/La 2 l + fif H fe ) 

1) Compute Cholesky decomposition on Ck, as C fe = G fe Gf , where G k € C NxN is a lower triangular matrix with positive 
diagonal entries. Then, G^7 1 can be easily obtained and we have (G^ 1 ) 11 = Qk in 

2) Compute A' k = Q^A/tQ^, then compute ED on A^ as A' k \Jk = UfeAfe. Note must be unitary and it can be also 
obtained by computing the left singular matrix of A' k in terms of SVD. 

3) Compute P fc = Q fc U fc . 
Output: P' fe = 7 P fc (I i ;0). 



B. Precoder Design 

The simultaneous diagonalization in general form stated in Proposition 1 draws a significant 
distinction from the original GED based deduction in © and ©. This allows us to design a 
new precoding scheme. In specific, the proposed precoder ¥' k and matched decoder G' k can be 
designed as 

F' k = 7 P fc [II; 0] , G' k = (H k F' k f (12) 

in which 7 is a normalization factor so that Tr (F'^F^ ) = L. It is clear that G' fc H fc F fc amounts 
to a certain diagonal matrix, also resulting in inter-stream-interference free. 

The remaining problem is how to compute a specific precoder F' fc for each user. Based on our 
proof of Proposition 1, we present a closed-form expression using a simple and low-complex 
algorithm, outlined in Algorithm 1. In the next subsection, we reveal the superiority of the 
proposed precoding scheme through per-stream SINR discussion. 

C. Performance Discussion 

Firstly, continuing to use the same symbols A^ and as in the proof of Proposition 1, we 
can show that A k t kt = \iB k t ki and A k p ki = (0;/^) B k p ki from © and ([§]), in which t ki and 
p/ 0i correspond to the i-th column of Tfc and P k , respectively. Here, both Aj and 9i/u>i must be 
the generalized eigenvalues of the pair {A^, B^.}. It is then easy to see that 

X^Oj/uj, J = 1,2, ••- ,N (13) 
with {Xj}^ =l and {0j}^ =l being sorted in descending order while {ujj}^ =l sorted in ascending 
order. Now we have SLNR fc = (j2ti e i) I (E£=i( 1-0 O). which is slightly smaller than 
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SLNR™ ax given in Section II. 

On the other hand, the ultimate performance is decided by post-SINR. Clearly, the decoded 
signal should take the form 

s fe = G' fc H fc F fe s fe + G' k (H fc Y] K F i*i + n fc) • (14) 
Thanks to diagonal form in © and ([8]), the covariance matrix of noise vector is given by 
E (G' fc n fc nf Gf ) = a 2 I L E (G' fe Gf ) = 7 V 2 I L diag (6 U ■ ■ ■ , 9 L ) . Furthermore, it can be verified 
through numerical results (difficult via theoretical analysis though) that the residual CCI is much 
smaller than the noise power at high SNR. As such, the SINR on the /-th stream, rj[ can be 
approximately calculated as r}\ = (7 4 # 2 ) / (-f 2 a 2 9i) = ^ 2 Qija 2 . Then, for any two streams I and 
m with / > m, the margin of AJ m between r][ and 77^ in terms of decibel (dB) can be expressed 
as 

A[ m = 101og 10 (v'Jv'J = 101og 10 (0i/9 m ) . (15) 
Following the same analysis, the margin of Aj m for the original scheme can be analogously 
calculated as 

A i)m = 101og 10 (rji/rjm) = 101og 10 (A//A m ) . (16) 

According to (TT3T) . we have Xi/X m = (di^m) / (Qm^i)- Further, we have that cu m > cui for / > m 
by definition. It then ensures that the following inequality holds: 

Km < Km- (IV) 

This explicitly shows that the SINR margin between any two streams decreases by applying the 
proposed scheme. In other words, the effective channel gains between the multiple streams are 
now less unbalanced. Its effectiveness will be further examined by simulation in the next section. 

IV. Simulation Results 

Fig. 1 compares the simulated bit error rate (BER) per user in a MU-MIMO system with 
different system configurations. Here, P denotes the proposed precoding scheme and O denotes 
the original scheme in 0. QPSK modulation with Gray mapping is employed and the BER 
curves are plotted versus the transmit SNR (L/a 2 ). It is seen that the proposed scheme and the 
original scheme for single-stream case (L = 1) achieve the same BER performance. For multiple 
streams (L — 2 and 3), the former outperforms the latter with sizeable gains. In specific, a gain 
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SNR (dB) 

Fig. 1. Uncoded BER of a MU-MIMO system with TV = 8 transmit antennas at the BS and K = 2 users each with M = 3 
receive antennas. 

of around 2 dB and 4 dB can be achieved at BER=10~ 4 for streams of L = 2 and L = 3, 
respectively. We also carried out the achievable sum-rate comparison. It is found that our scheme 
is almost the same as the original one. The results are omitted due to page limit. 

The above simulation results verify the effectiveness of the proposed precoding scheme over 
the original SLNR based scheme when there are multiple data streams for each user. 
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Abstract — Signal-to-leakage-and-noise ratio (SLNR) is a 
promising criterion for linear precoder design in multi-user (MU) 
multiple-input multiple-output (MIMO) systems. It decouples the 
precoder design problem and makes closed-form solution avail- 
able. In this letter, we present a new linear precoding scheme by 
slightly relaxing the SLNR maximization for MU-MIMO systems 
with multiple data streams per user. The precoding matrices are 
obtained by a general form of simultaneous diagonalization of two 
Hermitian matrices. The new scheme reduces the gap between the 
per-stream effective channel gains, an inherent limitation in the 
original SLNR precoding scheme. Simulation results demonstrate 
that the proposed precoding achieves considerable gains in error 
performance over the original one for multi-stream transmission 
while maintaining almost the same achievable sum-rate. 

Index Terms — Signal-to-leakage-and-noise ratio (SLNR), linear 
precoding, multi-user MIMO. 



I. Introduction 

THE significance of a downlink multi-user multiple-input- 
multiple-output (MU-MIMO) system is to allow a base 
station (BS) to communicate with several co-channel mo- 
bile stations (MS) simultaneously and thereby considerably 
increase the system throughput. To utilize the benefit, it is 
essential to suppress co-channel interference (CCI). Among 
many CCI suppression schemes, linear precoding gains the 
popularity because of its simplicity for implementation and 
good performance. To design the optimal linear MU-MIMO 
precoding scheme, it is often desirable to maximize the output 
signal-to-interference-plus-noise ratio (SINR) for each user. 
However, this problem is known to be challenging due to its 
coupled nature and no closed-form solution is available yet. A 
more tractable but suboptimal design is to enforce a zero-CCI 
requirement for each user, such as block diagonalization (BD) 
JT| and coordinated beamforming (CB) Q . 

In (3), the authors propose a so-called signal-to-leakage- 
and-noise ratio (SLNR) as the optimization metric for linear 
precoder design. This metric transforms a coupled optimiza- 
tion problem into a completely decoupled one, for which a 
closed-form solution is available. Unlike the BD approach, 
it does not impose a restriction on the number of transmit 
antennas at the BS. Moreover, it is applicable for any number 
of users and data streams in conttast to CB scheme. Specif- 
ically, the SLNR based linear precoding weights in ( 3 1 are 
obtained by the generalized eigenvalue decomposition (GED) 
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of the channel covariance matrix and the leakage channel-plus- 
noise covariance matrix of each user. However, a drawback 
of such GED based precoding scheme is that, when each 
user has multiple data streams, the effective channel gain for 
each stream can be severely unbalanced. If power control or 
adaptive modulation and coding cannot be applied, the overall 
error performance of each user will suffer significant loss. 

In this letter, we present a new linear precoding scheme 
based on the SLNR criterion for a downlink MU-MIMO 
system with multiple data streams per user. The design goal is 
to reduce the margin between the effective SINRs of multiple 
data streams. To do this, we introduce a slight relaxation for 
pursuing SLNR maximization (Note that maximizing SLNR at 
the transmitter side does not necessarily lead to output SINR 
maximization at each receiver). Thereby, we obtain a general 
form of simultaneous diagonalization of two covariance matri- 
ces linked to the user's channel and leakage-plus-noise. Based 
on that, the new precoding matrices are then obtained. We also 
present a simple and low-complexity algorithm to compute the 
precoding matrix for each user. Simulation results confirm that, 
compared with the original scheme, our scheme demonstrates 
sizable performance gains in error rate performance for multi- 
stream transmission while maintaining almost the same sum- 
rate performance. 

Notations: E (■), Tr (•), (■) 1 , and (-) H denote expectation, 
trace, inverse, and conjugate transpose, respectively. ||-|| F 
represents the Frobenius norm. Ijy is the N x N identity 
matrix, diag (01, • • • , on) is the diagonal manix with element 
a n on the n-th diagonal. Besides, C MxN represents the set of 
M X N matrices in complex field. 

II. System Model 

We consider a downlink MU-MIMO system with N trans- 
mit antennas and M receive antennas at each of the K active 
users. Let Hfc G £^ MxN denotes the channel from the BS to 
the MS k and H fe = [Hf , ■ ■ • , Hf_ x , Hf +1 , • • • , Hf ] H e 
q(k-1)MxN re p resen t the corresponding concatenated leak- 
age channel. A spatially uncorrected flat Rayleigh fading 
channel is assumed. The elements of Hfe are modeled as inde- 
pendent and identically distributed complex Gaussian variables 
with zero-mean and unit- variance. In addition, we assume Hi;, 
and also H^, have full rank with probability one. For a specific 
vector time, the transmitted vector symbol of user k is denoted 
as Sfc € C Lxl , where L (< M) is the number of data streams 
supported for user k and is assumed equal for all the users for 
simplicity. The vector symbol satisfies the power constraint 
E (sfcs;?) = II- Before entering into the MIMO channel, the 
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vector Sfc is pre-multiplied by a precoding matrix Ffc £ C NxL . 
Here, power allocation and rate adaptation among data streams 
can be applied. However, the signal design or feedback support 
may be relatively complex and thus we resort to precoding 
design only in this work. Then, for a given user k, the received 
signal vector can be written as 

r k = H fc FfcSfc + H fe Y] K FiSi + n k (1) 

^ — *i=l,i^£fc 

in which the second term represents CCI and the third term is 
the additive white Gaussian noise with E (rifcnf ) = o 2 \m- 

We review the original SLNR based precoding scheme in 
|3). Recall that the SLNR is defined as the ratio of received 
signal power at the desired MS to received signal power at 
the other terminals (the leakage) plus noise power without 
considering receive matrices, given by 



SLNRt 



Tr(FfHfH fc F fc ; 



Tr (Ff (M/LaH + Hf H fc ) F A 



(2) 



for k = 1, • • • ,K. According to the SLNR criterion, the 
precoding matrix Ffc is designed based on the following metric 

(3) 



popt _ 



are max SLNRfc 

F k ec NxL 



with Tr (FfcFf ) = L for power limitation. Since Hf Hfc is 
Hermitian and positive semidefinite (HPSD) and M / La 2 I + 
fif Hfc is Hermitian and positive definite (HPD), by gen- 
eralized eigenvalue decomposition, there exists an invertible 
matrix T k £ C NxN such that 

Tf Hf H fc T fc = A k = diag (A lf • • • , X N ) (4) 



f (M/LaH 



H?Hi.) T 



h k 



k = In (5) 

with Ai > A2 > • • • > \n > 0. Here, the columns of TV and 
the diagonal entries of Afc are the generalized eigenvectors 
and eigenvalues of the pair {HfH fc , M/La 2 I + fif fi fc }, 
respectively. It is then shown in (fj) that the optimal precoder 
which is able to maximize the objective function (0 can be 
obtained by extracting the leading L columns of T k as 

pT fe [I L ;0], (6) 

where p is a scaling factor so that Tr (FfcFf ) = L. The 
resulting maximum SLNR value is given by SLNR" lax = 
X)i=i Aj/L. Along with the realization of the precoder, 
the matched-filter type receive matrix, denoted as Gfc = 
(MkFk) H , is applied at each user receiver, resulting in inter- 
stream interference free. Note that better performance could 
be achieved if a multi-user MMSE type receiver is adopted. In 
this letter, we still adopt MF-type detector at the receiver as in 
[3] for implementation simplicity and analytical convenience. 

A drawback of such GED based precoding scheme is that, 
when L > 2, the effective channel gain for each stream can 
be severely unbalanced as shall be illustrated in Section III- 
C. It is known that the overall performance of a user with 
multiple streams is dominated by the stream with the worst 
channel condition. Hence, such channel imbalance would lead 
to poor overall error performance for a user. In the next section, 
we allow a slight relaxation on the SLNR maximization, 
which provides additional degrees of freedom to design a new 
precoding scheme so as to overcome this drawback. 



III. Proposed Precoding Scheme 

A. Design Principle by Matrix Theory 

The expressions in (01 and ([5]) by the GED approach 
motivate us to find a more general form of simultaneous 
diagonalization of two matrices. Before introducing our results 
in Proposition 1, we review the following Lemma (31 Ch. 4, 
4.5.8]: 

Lemma 1: Let A, B £ C Xn be Hermitian. There is a 
non-singular matrix S £ C nXTl such that S H AS = B if and 
only if A and B have the same inertia, that is, have the same 
number of positive, negative, and zero eigenvalues. 

Proposition 1: For the pair of matrices 
{Hf H fe , M/La 2 I + flf Hfc}, there is a non-singular 
matrix P fe £ C NxN such that 

Pf Hf HfePfe = ©fe (7) 
Pf (M/Lo- 2 ! + Hf Hfe) Pfe = Ofe (8) 

in which 0fc = diag {6\, 62, ■ •• On) and fife = 
diag (u)\,u>2, ■ • • ujist) with the entries satisfying 1 > B\ > 
> 0, M +i = ■ ■ ■ = N = and < lu 1 < ■ ■ ■ < 



... > 



Hi 



ujm < 1, ^Ai+i = ■ ■ ■ = ujjy = 1 as well as 
i= 1,2,-.- ,N. 



uj, : = 1 for 



Proof: Denote Afe 



Hf Hfc, Bfc 



M/La I + Hf H 



and Cfe = Afc + B k . Let the eigenvalues A, (A fc ) , A, (B fc ) 
and At (Cfc), i = 1, 2, • • ■ , N, be arranged in increasing order. 
Since A fc is HPSD and B fc is HPD, namely, A^ (A fc ) > and 
Ai (B fc ) > 0, then by 4.3.1], we have A, (C fc ) > A, (A fe ) + 
Ai (Bfc) > 0, Vi This implies that C k is HPD. Then, by the 
matrix theory in J4] 4.5.8, Exercise], there must be a non- 
singular matrix Qfc £ <C NxN such that 

QfCfcQfc = Qf (Afc+B fe )Qfc=Lv. (9) 

Further, denote A' k = Qf A fc Q fe and B' fe = Qf B fe Qfc. By 
Lemma 1, it can be shown that A' k and B' fe have the same 
inertia with Afc and Bfc, respectively. Thus, A' k is HPSD and 
B' fc is HPD. Now, by using |4] 4.3.1] again, it is easy to show 
that 1 > Ai (A fc ) > and 1 > Ai (B' fc ) > 0. Next, according 
to the eigen-decomposition (ED) of a Hermitian matrix |4], 
there must be a unitary matrix Ufc £ C NxN such that 

Uf A' fe Ufc = diag (Ai (A fc ) , ■ • • , X N (A' k )) . (10) 

Applying Ufc in both sides of (0, we obtain 

Uf (A fc +B fc )Ufc=Lv. (11) 

Hence, observing ( fTOb and (fTTT >. we find that it is 
necessary for UfBJUfc to satisfy UfB' fc Ufe = 
diag ((1 - Ax(Afc)) , • • • , (1 - X N (A' k ))) . Clearly, as 
Ufc is unitary, then {1 — Ai (A' k )} i=1 must be the eigenvalues 



of B',,. To this end, we define P k 



QfcUfc. Since 



rank (Hf Hfc) = rank (Hfc) = AI and the rank is unchanged 
upon left or right multiplication by a nonsingular matrix, then 
we arrive at the results in dT) and (©. ■ 



B. Precoder Design 

The simultaneous diagonalization in general form stated in 
Proposition 1 draws a significant distinction from the original 
GED based deduction in (|4]i and (0. This allows us to design 
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Algorithm 1 The specific design of precoder F' fc for user k 
Input: A fc = Hf H fc , and C fc = (H»H fc + M/LoH + Hf H fc ) 

1) Compute Cholesky decomposition on C^, as Cj, = G k G k , where G k 6 

x ^ is a lower triangular matrix with positive diagonal entries. Then, G^" 1 
can be easily obtained and we have ^G^ 1 ^ = in (9). 

2) Compute A' k = Q?AfeQfc, then compute ED on A' k as Aj_Ufc = 
UfeAfc. Note Ujfc must be unitary and it can be also obtained by computing 
the left singular matrix of A.' k in terms of SVD. 

3) Compute P fc = Q fc U fc . 
Output: F' k = 7 P fe (Il;0). 



a new precoding scheme. In specific, the proposed precoder 
F' k and matched decoder G' k can be designed as 

Fj k = 7 P Jk p[ i ;0], G' k = (U k F' k f (12) 

in which 7 is a normalization factor so that Tr (F' k F' k H ) = L. 
It is clear that G' fc HfcFfc amounts to a certain diagonal matrix, 
also resulting in inter-stream-interference free. 

The remaining problem is how to compute a specific pre- 
coder Fl for each user. Based on our proof of Proposition 
1, we present a closed-form expression using a simple and 
low-complex algorithm, outlined in Algorithm 1. In the next 
subsection, we reveal the superiority of the proposed precoding 
scheme through per-stream SINR discussion. 

C. Performance Discussion 

Firstly, continuing to use the same symbols A^ and B& as in 
the proof of Proposition 1, we can show that A^t^ = A^B^t^ 
and AfcPfe; = (#j/o;j) B^p^ from (0 and (8), in which 
and Tpki correspond to the i-th column of T k and Pfc, 
respectively. Here, both \ and 6i/u>i must be the generalized 
eigenvalues of the pair {Afc,Bfc}. It is then easy to see that 

Xj =9j/cjj, j = 1,2, ••• ,/V (13) 

with {Xj}^ =1 and {^} J=1 being sorted in descending order 

while {^j}^ =1 sorted in ascending order. Now we have 

SLNRfc = (Ef=i0*)/(Ef=i(l which is slightly 
smaller than SLNR" given in Section II. 

On the other hand, the ultimate performance is decided by 
post-SINR. Clearly, the decoded signal should take the form 

§ fc = G' fe H fe F fc s fe + G' k f H fc F lS?; + n k ) . (14) 

Thanks to diagonal form in (0 and (|8}, the covariance 
matrix of noise vector is given by E (G^n^n^ G^ ) = 
o- 2 I L E (G' k G' k H ) = 7 2 <7 2 I z ,diag(6li,-- - ,0 L ). Furthermore, 
it can be verified through numerical results (difficult via 
theoretical analysis though) that the residual CCI is much 
smaller than the noise power at high SNR. As such, the 
SINR on the Z-th stream, 77! can be approximately calculated 
as n' t = (l 4 0f) j (*~f 2 a 2 9i) = -f 2 9i/a 2 . Then, for any two 
streams I and m with I > m, the margin of AJ between rj[ 
and rj' m in terms of decibel (dB) can be expressed as 

A' ltm = 10Iog 10 (n't/v'J = 101og 10 (0i/0 m ) . (15) 
Following the same analysis, the margin of A;. r „ for the 




SNR (dB) 

Fig. 1 . Uncoded BER of a MU-MIMO system with N = 8 transmit antennas 
at the BS and K = 2 users each with M = 3 receive antennas. 

original scheme can be analogously calculated as 

A(, m = 101og 10 (m/Vm) = 101og 10 (Aj/Am) . (16) 

According to ([13), we have A//A m = (6» ( w TO ) / (0 m W/). 
Further, we have that u> m > u>t for I > m by definition. It 
then ensures that the following inequality holds: 

K m < A,, m . (17) 

This explicitly shows that the SINR margin between any 
two streams decreases by applying the proposed scheme. In 
other words, the effective channel gains between the multiple 
streams are now less unbalanced. Its effectiveness will be 
further examined by simulation in the next section. 

IV. Simulation Results 

Fig. 1 compares the simulated bit error rate (BER) per user 
in a MU-MIMO system with different system configurations. 
Here, P denotes the proposed precoding scheme and O denotes 
the original scheme in 15) . QPSK modulation with Gray 
mapping is employed and the BER curves are plotted versus 
the transmit SNR (L/a 2 ). It is seen that the proposed scheme 
and the original scheme for single-stream case (L = 1) achieve 
the same BER performance. For multiple streams (L = 2 and 
3), the former outperforms the latter with sizeable gains. In 
specific, a gain of around 2 dB and 4 dB can be achieved at 
BER=10 -4 for streams of L = 2 and L = 3, respectively. 
We also carried out the achievable sum-rate comparison. It is 
found that our scheme is almost the same as the original one. 
The results are omitted due to page limit. 

The above simulation results verify the effectiveness of the 
proposed precoding scheme over the original SLNR based 
scheme when there are multiple data streams for each user. 
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